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(54) Wan-based voice gateway 



(57) In one embodiment of the invention, an appa- 
ratus is provided for establishing a communication ses- 
sion between first and second terminals in communica- 
tion over a plurality of networks that employ differing 
transmission standards. The plurality of networks are 
selected from among a circuit switched network (e.g., a 
telephony network), a connectionless packet switched 
network (e.g., the Internet) and a connection-oriented 
packet switched network (e.g., an ATM or frame relay 
network). The apparatus includes a call set-up translator 
for translating among call set-up protocols associated 
with the circuit switched network, the connectionless 
packet switched network and the connection-oriented 



packet switched network. An encoding format translator 
is provided for translating among encoding protocols as- 
sociated with the circuit switched network, the connec- 
tionless packet switched network and the connection- 
oriented packet switched network. Also provided is an 
address database for storing a plurality of addresses in 
different formats for each registered terminal, which in- 
cludes the first and second terminals. The apparatus al- 
so includes a session manager for storing control infor- 
mation relating to the first and second terminals. The 
control information includes an identification of the first 
and second terminals that participate in the communi- 
cation session. 
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Description 

Technlcai Field 

This invention relates generally to an apparatus for 
establishing comnriunications paths over a circuit 
switched network, a connectionless packet switched 
network, and a connection-oriented packet switched 
network, and more particularly to an apparatus for es- 
tablishing point-to-point or point-to-multipoint audio or 
video communication over a telephony network, the In- 
ternet, and an Asynchronous Transfer Mode (ATM) or a 
Frame Relay (FR) network. 

Background of the invention 

Voice traffic transmitted between two or more users 
over a telephony network is carried over circuit-switched 
paths that are established between the users. Circuit- 
switched technology is well-suited for delay-sensitive, 
real-time applications such as voice transmission since 
a dedicated path is established. In a circuit-switched 
network, all the bandwidth of the established path Is al- 
located to the voice traffic for the duration of the call. 

In contrast to the telephony network, the Internet is 
an example of a connectionless packet -switched net- 
work that is based on the Internet Protocol (IP), While 
the majority of the traffic carried over the telephony net- 
work is voice traffic, the Internet is more suitable to de- 
lay-insensitive applications such as the transmission of 
data. The Intemet community has been exploring im- 
provements in IP so that voice can be carried over IP 
packets without significant performance degradation. 
For example, the resource reservation protocol known 
as RSVP (see RSVP Version 2 Functional Specifica- 
tions, R. Braden, L.Zhang, D. Estrin, Intemet Draft, 
<draft-ietf-rsvp-spec-06, 1 996) provides a technique for 
reserving resources (i.e. bandwidth) for the transmis- 
sion of unicast and multicast data with good scaling and 
robustness properties. The resen/ed bandwidth is used 
to effectively simulate the dedicated bandwidth scheme 
of circuit-switched networks to transmit delay-sensitive 
traffic. If RSVP is implemented only for those communi- 
cations having special Quality of Service (QoS) needs 
such as minimal delay, the transmission of other com- 
munications such as non-real time data packets may be 
provided to other users of the Internet in the usual best- 
effort, packet-switched manner. 

The majority of Internet users currently access the 
Intemet via slow-speed dial/modem lines using proto- 
cols such as SLI P (serial line I P) and PPP (Point to Point 
Protocol), which run over serial telephone lines (modem 
and N-ISDN) and carry IP packets. Voice signals are 
packetized by an audio codec on the user's multimedia 
PC. The voice packets carry substantial packetization 
overhead including the headers of PPP, IP, UDP, and 
RTP, which can be as big as 40 octets. Transmitting 
voice packets over low speed access lines is almost im- 



possible because of the size of the header relative to 
the size of a typical voice packet (20-160 octets, based 
on the average acceptable voice delay and amount of 
voice compression). However, several proposals have 
5 emerged to compress the voice packet headers so that 
greater transmission efficiency and latency can be 
achieved for voice-packets transmitted over low-speed, 
dial access lines. 

A substantial number of users are expected to begin 
10 sending voice traffic over the Intemet with acceptable 
voice quality and latency because of the availability of 
RSVP and packet-header compression technologies. 
The transmission terminals for sending packetized 
voice over the Internet are likely to be multimedia per- 
is sonal computers. 

in addition to the telephony network and the Inter- 
net, other transmission standards such as Frame-Relay 
and ATM have been emerging as alternative transport 
technologies for integrated voice and data. ATM/FR net- 
20 works are similar to the telephony network in that they 
both employ connection-oriented technology However, 
unlike the telephony network, ATM/FR networks employ 
packet switching. In contrast to the Internet Protocol, 
which is a network layer protocol (layer three), FR and 
2S ATM pertain to the data link layer (layer two) of the sev- 
en-layer OSl model. 

Frame Relay and ATM can transport voice in two 
different formats within the FR (or ATM) packets (cells). 
In the first format, the FR (ATM) packets (cells) carry an 
30 IP packet (or some other layer 3 packet ), which in turn 
encapsulates the voice packets. Alternatively, the FR 
(ATM) packets (cells) directly encapsulate the voice 
packet, i.e., without using IP encapsulation. The first al- 
ternative employs protocols such as L^N Emulation 
35 (LANE), Classical IP Over ATM, and Multiprotocol Over 
ATM (MPOA), all of which are well known in the prior 
art. The second alternative is referred to as "Voice over 
FR" and "Voice over ATM', respectively. Note that the 
first alternative, which includes IP encapsulation, allows 
40 voice packets to be routed between IP routers. That is, 
layer three processing is performed by the routers along 
the voice path to determine the next hop router The sec- 
ond alternative is a purely FR/ATM switched solution. In 
other words, switching can be performed only at the data 
45 link layer. FIG, 1 depicts the protocol stacks for transport 
of voice over IP and the two alternatives for voice over 
FR/ATM. 

The audio codec depicted in FIG. 1 enables voice 
encoding/decoding, including voice digitization, com- 
50 pression, silence elimination and formatting. The audio 
codec is defined by ITU-T standards such as G.711 ( 
PCM of Voice Frequencies), G.722 (7 Khz Audio-Cod- 
ing within 64 Kbps), G.723 (Dual Rate Speech Code for 
Multimedia Telecommunications Transmitting at 6.4 and 
55 5.3 Kbps), and G.728 (Speech Encoding at 16 Kbps). 
The "Voice over ATM/FR layer" depicted in FIG. 1 
is referred to as the multimedia multiplex and synchro- 
nization layer, an example of which is defined in ITU-T 
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standard H.222. ITU-T is currently defining the H.323 
standard, which specifies point-to-point and multipoint 
audio-visual communications between terminals (such 
as PCs) attached to LANs. This standard defines the 
components of an H.323 system including H.323 termi- 
nals, gate-keepers, and multi-point control units 
(MCUs). PCs in communication with the Internet can 
use the H.323 standards to communicate with each oth- 
er on the same LAN or across routed data networks. In 
addition to H.323, the ITU-T is in the process of defining 
similar audio-visual component standards for B-ISDN 
(ATM) in the H.310 standard , and for N-ISDN in the H. 
320 standard. The previously mentioned standards also 
define call signaling formats. For example, IP networks 
use Q.931 call controls over a new ITU-T standard 
known as H.225 (for H.323 terminals). Telephony net- 
works use Q.931 signaling and ATM networks use Q. 
2931 signaling. 

Many standards bodies are in the process of defin- 
ing how voice (and video) can be transported within a 
given homogenous network such as the telephony, IP, 
FR and ATM networks. However, there Is currently no 
arrangement for transmitting voice over a heterogene- 
ous network that consists of two or more such networks 
employing different transmission standards. 

Summary of the Invention 

In accordance with the principles of the invention, 
the foregoing problem can be addressed by employing 
a WAN -based Voice Gateway which connects to the te- 
lephony network, the Internet and the ATM/FR network. 
Given that network users will be in communication with 
a variety of such heterogeneous networks, gateway ca- 
pabilities will be needed between them to support end- 
point stations in a voice session which are connected to 
one or more of these different networks. The telephony 
network, Internet and FR/ATM Networks all use different 
schemes for establishing a voice session (i.e., call set- 
up protocols), and different formats for controlling a ses- 
sion and transporting voice. The gateway of the present 
invention provides conversion of the transmission for- 
mat, control, call signaling and audio stream (and po- 
tentially video and data streams) between different 
transmission standards. The capabilities of the gateway 
may also include audio coding translation (e.g. , between 
G.722 and G.728) and address translation between dif- 
ferent address types (e.g., a telephone number and an 
IP address). 

In some embodiments of the invention the voice 
gateway 100 performs the following functions: call-sig- 
naling protocol conversion (e.g., between Q.931, Q. 
2931, H.225); audio mixing/bridging or generation of 
composite audio and switching; address registration; 
address translation (e.g., IP <-> E.I 64 <•> NSAP<-> 
email); audio format conversion (e.g., from G.711 to G. 
728); session management/control (e.g. manage 
number of end points in a call); interfacing with other 



gateways (e.g. WAN-to-WAN or WAN-to-local); interfac- 
ing with the SS7 signaling network; and interfacing with 
the Internet signaling network. 

In one embodiment of the invention, an apparatus 
5 is provided for establishing a communication session 
between first and second terminals in communication 
over a plurality of networks that employ differing trans- 
mission standards. The communication session may be 
an audio session, a video session or a multimedia ses- 
'io sion. The plurality of networks are selected from among 
a circuit switched network (e.g., a telephony network), 
a connectionless packet switched network (e.g., the In- 
ternet) and a connection-oriented packet switched net- 
work (e.g., an ATM or frame relay network). The appa- 
'5 ratus includes a call set-up translator for translating 
among call set-up protocols associated with the circuit 
switched network, the connectionless packet switched 
network and the connection -oriented packet switched 
network. An encoding format translator is provided for 
^0 translating among encoding protocols associated with 
the circuit switched network, the connectionless packet 
switched network and the connection -oriented packet 
switched network. Also provided Is an address database 
for storing a plurality of addresses in different formats 
25 for each registered terminal, which includes the first and 
second terminals. The apparatus also includes a ses- 
sion manager for storing control information relating to 
the first and second terminals. The control information 
includes an identification of the first and second termi- 
te nals that participate in the communication session. 

Brief Description of the Drawing 

In the drawings: 

35 

FIG. 1 shows a simplified protocol stack for trans- 
porting voice over an IP network, a telephony network 
(e.g., an ISDN network) and an ATM/FR network. 
FIG. 2 shows a voice gateway in accordance with 
<o the present invention situated among a telephony net- 
work, an IP network and a AM/FR network. 

FIG. 3 shows a plurality of voice gateways interfac- 
ing with one another and with user terminals. 

FIG. 4 shows a simplified diagram of a voice gate- 
45 way interconnected with various networks. 

FIG. 5 is a block diagram showing the functionality 
of various interfaces of which the voice gateway is com- 
prised. 

FIG. 6 shows a flow chart of an exemplary method 
50 for processing calls through the voice gateway in ac- 
cordance with the present invention. 

FIG. 7 is a block diagram of one embodiment of the 
voice gateway shown in FIGS. 2-4. 

55 Detailed Description 

FIG. 2 shows a voice gateway 100 in accordance 
with the present invention. As shown, the gateway 100 
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communicates with networks employing differing trans- 
mission standards such as telephony network 52, ATM/ 
FR network 57 and Internet 53. The gateway 1 00 is con- 
nected to a switch, router or sender, and an ATM/FR 
switch, which are located in the telephony network 52, 5 
the Internet 53, and the ATM/FR network 57, respective- 
ly. The gateway 100 facilitates voice communication be- 
tween a variety of end-point stations connected to the 
individual networks. Such stations may include tele- 
phone 61 , fax machine/telephone 62, and PC 63 (which io 
are connected tothe telephony network 52), PCs 71 and 
72 (which are connected to the Internet 53) and work- 
stations 81 and 82 (which are connected to the ATM/FR 
network 57). Voice gateway 100 will be deployed in a 
distributed fashion. That is, various gateways can be in- '5 
terconnected and used in a tandem manner to complete 
calls. 

The voice gateway 100 includes an interface to 
each of the networks 52, 53, and 57. These interfaces, 
depicted in functional form in FIG. 4, will be described 20 
below in additional detail. In general, the interfaces 
serve to convert and manage call signaling among the 
different networks and to mix voice calls received from 
within a given network. 

As shown in FIG. 3, voice gateway 1 00 may also be 2S 
in direct communication with other voice gateways 102 
and 103 in the WAN and local voice gateways 105 and 
107 attached to the customer's LAN, local ATM/FR net- 
works, or voice terminals. A WAN voice gateway serves 
as a "master" gateway with respect to local gateways 30 
directly attached thereto. In this configuration the local 
gateways serve as so-called "slaves." When two WAN 
voice gateways such as gateways 100 and 102 are in 
direct communication they may act as "peers" with re- 
spect to one another while each one also functions as 35 
a "master" to the local gateway to which its in direct com- 
munication. In an alternative configuration, the WAN 
voice gateways in direct communication with one anoth- 
er may be arranged in a hierarchical manner in such a 
way that each WAN voice gateway is connected to an- 40 
other WAN voice gateway that serves as Its "master" 
gateway. A "peer" configuration solution is generally 
more suitable when the WAN voice gateways are ar- 
ranged in a mesh-connected topology while the "mas- 
ter" (hierarchical) configuration is generally more suita- 
ble for a tree-connected topology. FIG. 3 depicts a 
mesh -connected topology in which each WAN voice 
gateway 100, 102, and 103 is a peer of its neighboring 
WAN gateways. 

As seen in FIG. 4, an IP call set-up interface 101 is so 
provided for receiving and terminating call-setup re- 
quests from the Internet and for generating call-set up 
requests to the Internet to establish connections be- 
tween two or more Internet stations, telephony stations, 
frame relay stations, and/or ATM stations. Interface 101 ss 
sends and receives call setup requests in the form of IP 
packets using signaling protocols such as Q.931 (or a 
sub-set of Q.931 as defined in H.323) or another sign- 



aling protocol that may be developed particularly for 
transmitting voice over IP. The IP call set-up interface 
101 receives call-setup requests from the telephony call 
set-up interface 102 (discussed below) in the form of 
DTMR Q.931 or other signaling standards. The inter- 
face 101 also receives call-setup requests from the 
ATM/FR call set-up interface 103 (discussed below) if 
the call-setup request is in the form of Q.2931 . A sign- 
aling format translator 104 is provided to translate the 
call-setup requests into a form that the interface 101 can 
properly understand. The translation is performed be- 
fore the requests are fon/varded to the IP call set-up in- 
terface 101, The interface 101 monitors the status of 
each call establishment session and transmits error 
messages, as appropriate, in the form of audio messag- 
es or digital data to each IP station participating in the 
session. 

The gateway 1 00 also includes a telephony call set- 
up interface 102 for receiving call-setup requests from 
the telephony network 52 or sending call-setup requests 
to the telephony network 52 to establish connections be- 
tween two or more Internet stations, telephony stations, 
frame relay stations and/or ATM stations. Telephony 
set-up interface 1 02 receives and sends call setup mes- 
sages in accordance with Q.931 or with other telephony 
signaling protocols. The interface 102 also generates 
SS7 signaling messages to a Network Control Point 
(NCP) to obtain, for example, a telephone number trans- 
lation prior to generating an outgoing Q.931 signaling 
message to the telephony network 52. Additionally, te- 
lephony call set-up interface 102 receives call-setup re- 
quests from the IP call set-up interface 1 01 and the ATM/ 
FR call set-up interface 1 03 (discussed below) if the call- 
setup request originates in one these networks. The sig- 
naling format translator 104 translates the call-set up in- 
to a fonn that is understood by the telephony call set-up 
interface 102. 

Similar to the interfaces 101 and 102 discussed 
above, an ATM/FR call set-up interface 103 Is provided 
for receiving call-setup requests from the ATM/FR net- 
work 57 and for transmitting call-set up requests to the 
ATM/FR network 57 to establish connections between 
two or more Internet stations, telephony stations, FR 
stations, and/or ATM stations. ATM/FR call set-up inter- 
face 103 sends and receives call setup requests in the 
form of packets employing FR/ATM signaling protocols. 
The ATM/FR call set-up interface 1 03 also receives call- 
setup requests from the telephony call set-up interface 
102, and the IP call set-up interface 101 if the call-setup 
request originates in the telephony or IP networks, re- 
spectively. The signaling format translator 104 trans- 
lates these requests into a form of that is understood by 
the ATM/FR call set-up interface 103. 

The gateway 1 00 further includes an IP packet mix- 
er 201 . The IP packet mixer 201 receives voice in the 
form of IP packet streams from one or more IP stations 
(including voice terminals or other voice gateways) and 
processes each incoming stream (e.g., by multiplexing 



4 



7 



EP 0 841 831 A2 



8 



the various voice streams onto a single IP packet 
stream). The IP mixer 201 also performs appropriate 
voice encoding translation into a format compatible with 
the voice decoding capabilities of each receiving station 
as identified by the session manager 304. The IP mixer 

201 subsequently transmits the IP packets to the other 
IP stations participating in the session. If there are sta- 
tions participating in the communication session which 
are not IP stations, (as identified by the session manag- 
er 304), the IP packet mixer 201 sends those packets 
received from the stations to the format translator 204, 
which then de-encapsulates and converts the IP pack- 
ets into a format appropriate for the telephony bridge 

202 and/or ATM/FR mixer 203. 

In some embodiments of the invention, the IP pack- 
et mixer 201 also provides control functionality that 
would othenwise be performed by the IP call set-up in- 
terface 101. In particular, the IP packet mixer 201 per- 
forms such control functions when in-band signaling is 
employed. If out-of-band signaling is employed, the con- 
trol functions may conveniently reside in the IP call set- 
up interface 101. In the former situation the IP packet 
mixer receives control packets over an IP connection 
such as a dedicated UDP or TCP socket interface, for 
example. The control packets identify the control infor- 
mation pertaining to the station from which it receives 
the packet, such as the type of voice encoding that is 
employed by the station, bandwidth utilization, andQoS 
requirements. Of course, if no control information Is pro- 
vided, previously defined default control parameters 
may be used. The IP packet mixer 201 is also used by 
an IP station to terminate its participation in a session. 
The session control information received by the IP pack- 
et mixer 201 is fon/varded to the session manager 304 
to maintain a current database of station requirements. 

The telephony bridge 202 is the mirror image of the 
IP packet mixer 201 . The bridge 202 bridges (mixes and 
switches) voice calls received from a plurality of teleph- 
ony network stations during a voice session. If there are 
stations participating in the session which are not te- 
lephony stations, the bridge 202 sends the digital voice 
signals it receives from the telephony stations to the 
voice format interface 204 (discussed below), which 
performs echo cancellation, voice encoding, encryption 
and packetization before the digitized voice is sent to 
the IP mixer 201 and/or the ATM/FR mixer 203 for sub- 
sequent forwarding. Telephony bridge 202 also receives 
calls from voice terminals and other voice gateways. 

The ATM/FR mixer 203 is also the mirror image of 
the IP packet mixer 201. The mixer 203, which bridges 
voice calls received from a plurality of ATM/FR stations 
during a voice session, can mix a plurality of different 
voice streams onto ATM/FR cells/packets. If there are 
stations participating in the session which are not ATM/ 
FR stations, the mixer sends the ATM/FR voice packets 
it receives from the ATM/FR stations to the voice format 
Interface 204, which performs any appropriate de-en- 
capsulation, protocol conversion, packetization, etc.. 



before the digitized voice is sent to the IP packet mixer 
201 and/or the telephony bridge 202 for subsequent for- 
warding. 

The signaling format translator 104 is employed by 
5 the gateway 1 00 to convert and adapt among telephony 
signaling (Q.931), SS7 signaling, IP call signaling, and 
FR/ATM signaling protocols. For example, the interface 
104 receives signaling messages from the call set-up 
interface 101 and parses the message, performs appro- 
priate address translation using the address translator 
105, and translates the signaling format to another sig- 
naling format before sending it to the appropriate outgo- 
ing call signaling interface. 

The voice format interface is provided to convert 
J5 and adapt among the various telephony IP, FR and ATM 
voice formats, including voice encoding changes, echo 
cancellation, re-synchronization and packetization. 

An address translator 1 05 is also which allows var- 
ious stations to register using email address, IP ad- 
20 dress, E.I 64 address, MAC address and/or ATM NSAP. 
address formats. The interface can also translate ad- 
dresses from one address format to another. When mul- 
tiple gateways are employed, each "master" gateway 
may collect the address registrations stored in its "slave" 
25 gateways. The interface also maintains a list defining 
the correspondence between the station addresses di- 
rectly connected to the voice gateway 100. 

A session manager interface 304 is employed to re- 
ceive control information from the mixers, bridges and 
30 call set-up interfaces which pertains to the capabilities 
and status of those stations participating in the commu- 
nication session. The interface 304 assists the IP mixer 
201 , telephony bridge 202 and FR/ATM mixer 203 in for- 
warding voice traffic to all participating stations. 
55 As illustrated in FIG. 5 the voice gateway 100 also 
connects to various common Operations Administration 
Management and Provisioning (OAM&P) functions, da- 
tabases/directories (e.g., authentication databases 
such as for credit card authorization), and signaling net- 
40 work intelligence that reside within the SS7 signaling 
network such as a network control point (NCP) and an 
Internet NCP residing within the Internet. For example, 
an NCP may be used by the telephony call set-up inter- 
face 102 to translate an 800 number into a telephone 
45 number. Similarly, an Internet NCP may be used by the 
IP call set-up interface 101 to request a translation of a 
station's email address, host name, or URL to an IP The 
Internet NCPs provide intelligent services, such as dis- 
cussed in U.S. Application Serial No. 08/618,483. 
so FIG. 6 shows a flow chart of an exemplary method 
for establishing a voice session between the user sta- 
tions 300 and 600 of FIG. 3 in accordance with the prin- 
ciples of this invention. As seen in FIG. 3, station 300 is 
provided with direct connectivity to the Internet via voice 
55 gateway B. Station 600 communicates with the voice 
gateway C via an N-ISDN interface. In FIG. 3, the voice 
gateways A, B and C are all "peers" and any local gate- 
ways attached thereto serve as "slaves." 
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The method begins at step 501 when station 300 
sends a call signaling request over the Internet to voice 
gateway B in the form of an IP packet. The IP packet 
carries signaling information (e.g., in the form of a Q. 
931 message), including the IP address of the called 
station 600. In step 503, the IP call set-up Interface 101 
parses the I P packet and retrieves the IP address of sta- 
tion 600. In step 505, the IP call set-up interface 101 
sends an address query to the address translator 105 
to retrieve other addresses for station 600. In step 511 , 
the address translator 105 maps the IP address of sta- 
tion 600 to a toll-free 800 number. Thereafter, at the con- 
ditional branch point 51 3, address translator 105 deter- 
mines if the 800 number of station 600 is served by voice 
gateway B. 

If the result in step 513 is no, indicating that voice 
gateway C serves station 600, the method continues 
with step 523 in which the address translator 1 05 returns 
to gateway B to retrieve the IP address of Voice Gate- 
way C for contacting station 600. This step implies that 
the call to station 600 should be forwarded to voice gate- 
way C, which is the "master" gateway responsible for 
serving station 600. Thereafter, in step 503, the IP call 
set-up 101 interface of Voice Gateway B routes the call 
to the IP call set-up Interface 101 of Voice Gateway C 
for further processing. The method then continues as 
described below. 

If the result in step 51 3 is YES, indicating that voice 
gateway B serves the 800 number of station 600, the 
address translator interface 105 sends the 800 number 
to the IP call set-up Interface 101 of gateway B in step 
515. In step 517, the IP set-up interface 101 of gateway 
B sends the 800 number to the signaling format interface 
104, which in turn constructs an SS7 message and for- 
wards it to the telephony call set-up interface 102. In 
step 519, the interface 102 sends the SS7 message to 
the NCP in the signaling network to translate the 800 
number Into a telephone number. The NOP provides the 
requested telephone number to the telephony call set- 
up interface 102. Once the proper telephone number Is 
determined, interface 102 sends a Q.931 message to 
station 600 In step 521 to establish the call. 

Once the station 600 is connected, across the Te- 
lephony Bridge 202 and IP Mixer 201, a control plane 
connection is first established between the Users 300 
and 600 in step 601 (note that this connection traverses 
both Voice Gateway C and Voice Gateway B). This con- 
nection is employed by both stations in step 603 to in- 
dicate their respective audio encoding preferences, say 
G.711 for station 300 and G.723 for station 600. Note 
that voice gateway B needs to know the encoding pref- 
erences of station 300 while voice gateway C needs to 
know the encoding preferences of station 600. Addition- 
ally, note that the format translation between station 300 
and station 600 on the connection plane occurs in voice 
gateway C (given that communication from voice gate- 
way C to voice gateway B uses tP as the network layer 
protocol). Once the station capabilities and preferences 



are known to each voice gateway in step 605, the ses- 
sion managers 304 in both gateways B and C store a 
conference table that includes the preferences of both 
users. Communication proceeds between stations 300 
5 and 600 in step 611 when station 300 sends a voice 
packet to the IP mixer 201 in gateway B, which in turn 
sends the packet to the I P mixer 201 in voice gateway C. 

The method described above in connection with 
FIG. 6 may be implemented in a similar manner if station 
10 600 is an ISDN terminal that employs voice over ISDN 
without Implementing the Internet protocol. 

FIG. 7 Is a block diagram of an exemplary embod- 
iment of WAN-based voice gateway 1001 which in- 
cludes a) central processing unit (CPU) 1002, b) inter- 
15 face port 1003 c) data bus 1004 and d) memory 1 005. 
Central processing unit (CPU) 1002 provides the com- 
putational capability necessary to control the processes 
of voice gateway 1001 . Data bus 1004 provides for the 
exchange of data between the components of voice 
gateway 1001 . Interface port 1003 provides for the ex- 
change of data between voice gateway 1001 and devic- 
es external to Voice Gateway 1001 via link high speed 
backbone 425. To this end, interface port 1 003 contains, 
for example, well-known data transceivers. Memory 
1005 includes 1 ) code portion 1006, which contains the 
instructions (program) used by CPU 1002 to control the 
processes of Voice Gateway 1001, such as those de- 
scribed herein above, and data storage portion 1007, 
which contains the information necessary to the voice 
gateway to perform its specific function, such as, ad- 
dress registration and translation. 

The foregoing merely illustrates the principles of the 
invention. It will thus be appreciated that those skilled in 
the art will be able to devise various arrangements 
which, although not explicitly described or shown here- 
in, embody the principles of the invention and are thus 
within its spirit and scope. 



1. An apparatus for establishing a communication 
session between first and second terminals in com- 
munication over a plurality of networks employing 
differing transmission standards, said plurality of 
networks being selected from among a circuit 
switched network, a connectionless packet 
switched network and a connection-oriented packet 
switched network, comprising: 

a call set-up translator for translating among 
call set-up protocols associated with said circuit 
switched network, said connectionless packet 
switched network and said connection-oriented 
packet switched network; 
an encoding format translator for translating 
among encoding protocols associated with said 
circuit switched network, said connectionless 
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packet switched network and said connection- 
oriented packet switched network; 
an address database for storing a plurality of 
addresses in different formats for each regis- 
tered terminal including at least said first and 5 
second terminals; 

a session manager for storing control informa- 
tion relating to the first and second terminals, 
said control information including an identifica- 
tion of the first and second terminals participat- io 
ing in the communication session. 



mation further includes information defining a for- 
mat in which data is to be received by at least one 
of the first and second terminals. • 

15. The apparatus of claim 1 4 wherein said data format 
is alterable during the communication session. 

16. The apparatus of claim 1 wherein said call set-up 
translator translates among a plurality of standards, 
including H.225. Q.931 , Q.2931 , and SS7 signaling 
standards. 



2. The apparatus of claim 1 wherein said circuit- 
switched network is a telephony network. 

3. The apparatus of claim 1 wherein said connection- 
less packet switched network is the Internet. 

4. The apparatus of claim 1 wherein said connection- 
oriented packet switched network is an ATM net- 
work. 

5. The apparatus of claim 1 wherein said connection- 
oriented packet switched network is a Frame-Relay 
network. 

6. The apparatus of claim 1 wherein said communica- 
tion session is established among at least three ter- 
minals and further comprising an aggregator for 
bridging a plurality of communications received 
from a plurality of the terminals and for transmitting 
said plurality of communications to remaining ones 
of said at least three terminals. 

7. The apparatus of claim 1 wherein said communica- 
tion session is an audio session. 



a call set-up translator for translating among 
call set-up protocols associated with said circuit 

IS switched network, said connectionless packet 

switched network and said connection -oriented 
packet switched network; 
an encoding format translator for translating 
among encoding protocols associated with said 

20 circuit switched network, said connectionless 

packet switched network and said connection- 
oriented packet switched network; 
an address database for storing a plurality of 
addresses in different formats for each regis- 

25 tered terminal including at least said first and 

second terminals; 

a session manager for storing control informa- 
tion relating to the first and second terminals, 
said control Information including an identifica- 
30 tion of the first and second terminals participat- 

ing in the communication session. 



8. The apparatus of claim 1 wherein said communica- 
tion session includes video information. 

9. The apparatus of claim 1 wherein said communica- 
tion session is a multimedia session including audio 
and video information. 



1 0. The apparatus of claim 1 wherein said encoding for- 45 
mat translator is an audio format translator. 



1 1 . The apparatus of claim 1 wherein said encoding for- 
mat translator is a video format translator. 

50 

12. The apparatus of claim 1 wherein said encoding for- 
mat translator is a multimedia format translator 



13. The apparatus of claim 1 wherein said control infor- 
mation further includes quality of sen/ice require- 55 
ments. 



14. The apparatus of claim 1 wherein said control infor- 
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FIG. 6 



^501 



USER 300 SENDS A CALL REQUEST TO VOICE GATEWAY 100 
TO CONNECT TO USER 600 (USING USER 6Q0's IP ADDRESS) 



IP CALL SIGNAUNG 101 PARSES THE IP CALL SIGNALING 
MESSAGE. RETRIEVES THE IP ADDRESS OF U SER 600 



505 



IP CALL SIGNAUNG 101 SENDS AN ADDRESS QUERY TO ADDRESS 
TRANSLATOR 105 TO RETRIEVE OTHER ADDRESSES OF USER 600 



511 



ADDRESS TRANSLATOR 105 MAPS THE IP ADDRESS INTO AN 800 NUMBErY 



513 T 

W IS USER 600 SERVED \NOl SEND IP ADDRESS OF 
\ BY GATEWAY B /iTHE VOICE GATEWAY C 



C 



I YES 523- 

ADDRESS TRANSUTOR 105 SENDS THE 800 NUMBER >515 



TO THE IP CALL SIGNAUNG 101 
T 



517 
519 

521 
601 

603 

605 
611 



IP CALL SIGNAUNG 101 SENDS THE 800 NUMBER TO SIGNAUNG 

FORMAT ADAPTOR 104 WHICH IN TURN CONSTRUCTS AN SS7 
MESSAGE AND SENDS IT TO THE TELEPHONY CALL SIGNAUNG 102 



TELEPHONY CALL SIGNAUNG 102 SENDS THE SS7 MESSAGE TO THE 
NCP AND RECEIVES THE TELEPHONE NUMBER CORRESPONDING 

TO THE 800 NUMBER FROM THE NCP 

♦ 



TELEPHONY CALL SIGNAUNG 102 ESTABUSHES AN 
ISDN CONNECTION TO USER 600 

T 



USER 300 SENDS ITS SESSION CAPABIUHES TO VOICE GATEWAY B 

OVER THE NEW CONNECTION. 
VOICE GATEWAY B IN RETURN PROXIES THE USER 300 BY SENDING 
USER 300 CAPABIUTIES TO VOICE GATEWAY C. 



USER 600 SENDS ITS SESSION CAPABIUTIES TO VOICE GATEWAY C 

OVER THE NEW CONNECTION. 
VOICE GATEWAY C IN RETURN PROXIES THE USER 600 BY SENDING 
USER 600 CAPABIUTIES TO VOICE GATEWAY B. 



L. w . =V 



SESSION MANAGER 304 IN BOTH VOICE GATEWAY B k C 
MAINTAINS USER 300 & 600 CAPABIUHES ASSOCIATED WITH THE CAU. 

X 



USER 300 SENDS A VOICE PACKET TO IP MIXER 201 WHICH IN 
TURN SENDS IT TO IP MIXER 201 IN VOICE GATEWAY 201 
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FIG, 7 
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(54) Wan-based voice gateway 

(57) In one embodiment of the invention, an appa- 
ratus is provided for establishing a communication ses- 
sion between first and second terminals In communica- 
tion over a plurality of networks that employ differing 
transmission standards. The plurality of networks are 
selected from among a circuit switched network (e.g., a 
telephony network), a connectionless packet switched 
network (e.g., the Internet) and a connection-oriented 
packet switched network (e.g., an ATM or frame relay 
network). The apparatus includes a call set-up translator 
for translating among call set-up protocols associated 
with the circuit switched network, the connectionless 
packet switched network and the connection-oriented 
packet switched network. An encoding format translator 
is provided for translating among encoding protocols as- 
sociated with the circuit switched network, the connec- 
tionless packet switched network and the connection- 
oriented packet switched network. Also provided is an 
address database for storing a plurality of addresses in 
different formats for each registered terminal, which in- 
cludes the first and second terminals. The apparatus al- 
so includes a session manager for storing control infor- 
mation relating to the first and second terminals. The 
control information includes an identification of the first 
and second terminals that participate in the communi- 
cation session. 
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